Open Source Bayesian Models. 3. Composite Models for Prediction of Binned Responses

نویسندگان

  • Alex M. Clark
  • Krishna Dole
  • Sean Ekins
چکیده

Bayesian models constructed from structure-derived fingerprints have been a popular and useful method for drug discovery research when applied to bioactivity measurements that can be effectively classified as active or inactive. The results can be used to rank candidate structures according to their probability of activity, and this ranking benefits from the high degree of interpretability when structure-based fingerprints are used, making the results chemically intuitive. Besides selecting an activity threshold, building a Bayesian model is fast and requires few or no parameters or user intervention. The method also does not suffer from such acute overtraining problems as quantitative structure-activity relationships or quantitative structure-property relationships (QSAR/QSPR). This makes it an approach highly suitable for automated workflows that are independent of user expertise or prior knowledge of the training data. We now describe a new method for creating a composite group of Bayesian models to extend the method to work with multiple states, rather than just binary. Incoming activities are divided into bins, each covering a mutually exclusive range of activities. For each of these bins, a Bayesian model is created to model whether or not the compound belongs in the bin. Analyzing putative molecules using the composite model involves making a prediction for each bin and examining the relative likelihood for each assignment, for example, highest value wins. The method has been evaluated on a collection of hundreds of data sets extracted from ChEMBL v20 and validated data sets for ADME/Tox and bioactivity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Two-Sample Prediction with Progressively Type-II Censored Data for Some Lifetime Models

Prediction on the basis of censored data is very important topic in many fields including medical and engineering sciences. In this paper, based on progressive Type-II right censoring scheme, we will discuss Bayesian two-sample prediction. A general form for lifetime model including some well known and useful models such asWeibull and Pareto is considered for obtaining prediction bounds ...

متن کامل

Non-linear Bayesian prediction of generalized order statistics for liftime models

In this paper, we obtain  Bayesian prediction intervals as well as Bayes predictive estimators under square error loss for generalized order statistics when the distribution of the underlying population belongs to a family which includes several important distributions.

متن کامل

Analytical Prediction of Indentation and Low-Velocity Impact Responses of Fully Backed Composite Sandwich Plates

In this paper, static indentation and low velocity impact responses of a fully backed composite sandwich plate subjected to a rigid flat-ended cylindrical indenter/impactor are analytically investigated. The analysis is nonlinear due to nonlinear strain-displacement relation. In contrast to the existed analytical models for the indentation of composite sandwich plates, the stacking sequence of ...

متن کامل

The Anatomy of DSGE Models with Banking Industry for Iran's Economy

The recent financial crisis has raised several questions with respect to the financial institutions and banking industry. Hence, over the last decade the Iranian banking industry has undergone many substantial changes, such as liberalization, government regulation and technological advances. What impacts do these changes have on the policy instruments? We have answered this question in this stu...

متن کامل

Analysis of Hierarchical Bayesian Models for Large Space Time Data of the Housing Prices in Tehran

Housing price data is correlated to their location in different neighborhoods and their correlation is type of spatial (location). The price of housing is varius in different months, so they also have a time correlation. Spatio-temporal models are used to analyze this type of the data. An important purpose of reviewing this type of the data is to fit a suitable model for the spatial-temporal an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 56 2  شماره 

صفحات  -

تاریخ انتشار 2016